Rushing to Judgement: How do Laypeople Rate Caller Engagement in Thin-Slice Videos of Human-Machine Dialog?
نویسندگان
چکیده
We analyze the efficacy of a small crowd of naı̈ve human raters in rating engagement during human–machine dialog interactions. Each rater viewed multiple 10 second, thin-slice videos of non-native English speakers interacting with a computerassisted language learning (CALL) system and rated how engaged and disengaged those callers were while interacting with the automated agent. We observe how the crowd’s ratings compared to callers’ self ratings of engagement, and further study how the distribution of these rating assignments vary as a function of whether the automated system or the caller was speaking. Finally, we discuss the potential applications and pitfalls of such a crowdsourced paradigm in designing, developing and analyzing engagement-aware dialog systems.
منابع مشابه
The Correlation of Machine Translation Evaluation Metrics with Human Judgement on Persian Language
Machine Translation Evaluation Metrics (MTEMs) are the central core of Machine Translation (MT) engines as they are developed based on frequent evaluation. Although MTEMs are widespread today, their validity and quality for many languages is still under question. The aim of this research study was to examine the validity and assess the quality of MTEMs from Lexical Similarity set on machine tra...
متن کاملMachine Learning Techniques for the Evaluation of a Chronic Disease-Management Dialog System
Computer-based dialog systems streamline simple data collection tasks over the telephone, but need continuous monitoring to assess their performance and, more importantly, whether callers are able to interact smoothly with them. Computergenerated call reports can be gathered readily; however, don’t measure clearly the successfulness of a given dialogue. Good caller satisfaction measures are onl...
متن کاملA Handsome Set of Metrics to Measure Utterance Classification Performance in Spoken Dialog Systems
We present a set of metrics describing classification performance for individual contexts of a spoken dialog system as well as for the entire system. We show how these metrics can be used to train and tune system components and how they are related to Caller Experience, a subjective measure describing how well a caller was treated by the dialog system.
متن کاملA Portable Auto Attendant System With Sophisticated Dialog Structure
An attendant system connects the caller to the party he/she wants to talk to. Traditional systems require the caller to know the full name of the party. If the caller forgets the name, the system fails to provide service for the caller. In this paper we propose a portable Auto Attendant System (AAS) with sophisticated dialog structure that gives a caller more flexibility while calling. The call...
متن کاملP125: How Do Achievement Goals Affect Statistics Anxiety? The Mediation of Academic Engagement
Statistics anxiety has a positive relationship with academic drop-out and deserves greater attention. This study examines a model explaining the impact of students’ achievement goals on their statistics anxiety through levels of their academic engagement. Three hundred and fifteen undergraduate students (53 males and 262 females) who studied educational sciences and psychology in the state univ...
متن کامل